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Abstract 


‘The gender gap reversal in educational attainment is ubiq- 
uitous in high-income countries, as well as in a growing 
share of low- and middle-income countries. To account for 
the reversal, this paper proposes a theoretical framework in 
which the interplay between the distributions of academic 
aptitudes and changes in the net benefits of schooling over 
time affect the gender composition of those getting more 
schooling. The framework is used to formulate and test 
alternative hypotheses to explain the reversal. The paper 
introduces the tail dynamics hypothesis, which builds 


on the lower dispersion of academic achievement among 
females observed empirically. It also studies the mean 
dynamics hypothesis, which is based on previous literature. 
Both hypotheses can explain the reversal in this frame- 
work. However, the assumption behind the tail hypothesis 
is better supported by the data. Its predictions are also 
consistent with gender differences in Scholastic Achieve- 
ment Test score dynamics and in international test score 
distributions that cannot be explained by previous theories. 
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1 Introduction 


The dramatic expansion of enrollment in education worldwide over the past decades was accom- 
panied by a puzzling and striking phenomenon. As individuals acquired more schooling over time, 
females not only caught up with males, but surpassed them in educational attainment, a drastic 
shift from the historical standard. This phenomenon, sometimes referred as the gender gap rever- 
sal in education, is widespread globally. Some hypotheses have been proposed in the US literature 
to account for this reversal.' However, to the best of our knowledge, there has been no attempt to 


test these hypotheses empirically using a comprehensive framework. 


Understanding the origins of the reversal is of interest in its own right, but also for efficiency pur- 
poses. Identifying its drivers can help understanding dynamics in other areas, particularly in the 
labor market. For efficiency purposes, it is relevant to determine whether differences in observable 
outcomes between genders originate from distortions such as discrimination or, instead, from opti- 
mizing behaviors based on gender differences in preferences or traits. In particular, learning about 


the origins of the reversal could potentially help identify areas of policy intervention. 


We start by establishing that the gender gap reversal in education is widespread across high-income 
and lower-income countries, and is broader than just a reversal in participation to tertiary educa- 
tion. Using time-series on enrollment rates in education by gender in more than 140 countries, 
we evidence that the reversal occurred in virtually all high-income countries, but also in a rapidly 
growing proportion of lower-income countries. In addition, we show that the reversal occurred in 


secondary and primary school non-completion.” While males have historically constituted the mi- 


'See Goldin et al. (2006), Cho (2007), Chiappori et al. (2009) or Fortin et al. (2015). 
>The latter applies mostly to lower income countries. 


nority of secondary and primary school non-completers, they became the majority over time. 


The central contribution of the paper is to propose a framework to account for this fact and to intro- 
duce a novel explanation for the reversal. An important innovation of the theory lies in placing the 
distributions of academic aptitude at its core. The model uses the micro foundations of investment 
in schooling of Card (1994), before aggregating to the macro level. It has three building blocks. 
First, at the micro level, the optimal number of years of schooling is increasing in academic apti- 
tude. Second, the benefits of investing in higher levels of schooling are allowed to vary over time 
and to differ between genders. Third, we introduce a distributional assumption which, together 
with an increase in the economy-wide incentives to invest in education, generates either a tail dy- 
namics or a mean dynamics depending on the assumption made. We show that both dynamics can 


create the reversal observed empirically. 


A key contribution of the paper is to introduce the tail dynamics hypothesis, or tail hypothesis, 
and to show that it can explain the gender gap reversal in education. The hypothesis is novel in 
the literature and builds on the findings of Machin and Pekkarinen (2008) who report that in an 
international assessment of 15 year-old students in OECD countries, girls exhibit a lower variance 
in test scores relative to boys in virtually all countries.’ It also formalizes the intuition of Becker 
et al. (2010b), who suggest that the lower variance of non-cognitive skills among females could 
induce a higher elasticity of enrollment to returns to schooling relative to males. We show that a 


higher male variability alone can explain the reversal, independently of means. 


Using the theoretical framework we developed and country-level time-series data on educational 


There exists a large and longstanding literature in psychology on this topic, such as Frasier (1919), Hedges and 
Nowell (1995), or Jacob (2002), among many others. 


attainment by gender, we fit the tail hypothesis by estimating gender differences in the first two 
moments of test score distributions. The joint evolution of the enrollment rate and the gender ratio 
generated by the model reproduces the within-country time dynamics observed in our sample. In 
particular, it can generate the gender gap reversal observed among participants in post-secondary 


education and secondary school non-completers. 


We model the mean dynamics hypothesis or mean hypothesis, suggested by previous literature, in 
two alternative ways. The first one is the mean benefit hypothesis (MBH), which posits that the net 
benefits of schooling for females have increased more rapidly over time than for males. The second 
way to model the mean hypothesis is through the mean performance hypothesis (MPH), which 
claims that the mean performance of females in achievement tests increased relative to males’. We 


show that both hypotheses can also produce a reversal in the education gender gap. 


The mean hypotheses and the tail hypothesis rely on assumptions and generate predictions that 
are testable empirically. For the tail hypothesis, we correlate our model parameter estimates for 
gender differences in test score distributions with estimates from the Project for International Stu- 
dent Assessment (PISA) in 40 countries. We find a significant and positive correlation between 
parameters estimated from the model and estimates from the PISA reading exam data. This indi- 
cates that country dynamics in the educational gender gap resonate gender differences in test score 


distributions across countries, providing support for the tail dynamics hypothesis. 


Regarding the mean hypothesis, we test whether the relationship between test scores and enroll- 
ment in tertiary education follows the predictions of the MBH. Estimates using cross-sectional data 


in 1980 and 2002 in the US cannot be reconciled with the assumptions of the MBH. For the MPH, 


we find some evidence that the relative academic performance of females has increased in the US, 
but international evidence is rather ambiguous. We also derive the mean and the tail hypotheses’ 
predictions on the dynamics of the SAT scores in the US. This test again lends slightly more sup- 
port for the tail hypothesis. Empirically, we mostly focus on tertiary education, but the logic of the 


model extends to secondary and even primary levels. 


The paper is organized as follows. Section 2 presents the empirical facts motivating the paper. 
Section 3 lays out our theoretical framework at the micro and aggregate level. Section 4 presents 
the hypotheses for the gender gap reversal. Section 5 formulates the tail hypothesis in our the- 
oretical framework, and shows that it can produce the gender gap reversal observed empirically. 
Section 6 performs a similar exercise for the mean dynamics hypotheses and tests their underlying 
assumptions. Section 7 assesses the validity of the predictions of the hypotheses against empirical 


data. Section 8 concludes. 


2 Empirical Motivation 


2.1 Data 


The data on educational attainment used throughout the paper are from the Barro-Lee educational 
attainment dataset.* The database consists of harmonized data on educational attainment disag- 
gregated by gender every five years from 1950 to 2010, for 146 countries. It provides information 


on the distribution of educational attainment of the adult population at seven different levels of 


4See Barro and Lee (2013) and Barro and Lee (2016) for a complete description of the dataset construction and 
methodology. 


schooling: no formal education, incomplete primary, complete primary, lower secondary, upper 
secondary, incomplete tertiary, and complete tertiary. This allows to compute the participation rate 
and completion rate at various levels of education for 5-year-band birth cohorts born from 1921 
to 1990, by gender.° To describe time trends, we look at three levels of educational attainment: 
primary school non-completion, secondary school non-completion, and participation to tertiary 
education. To test the gender gap reversal hypotheses, we mostly focus on the tertiary level, but 
show some results for the secondary level. Our final sample includes 125, 135 and 115 countries 


for primary, secondary and post-secondary levels, respectively.° 


2.2 The Gender Gap Reversal in Educational Attainment 


Several contributions have reported a convergence, followed by a reversal in the number of females 
attending tertiary education relative to males for the US (Charles and Luoh (2003), Goldin et al. 
(2006), Chiappori et al. (2009), Becker et al. (2010b) or Autor and Wasserman (2013)).’ Figure 
I shows that the gender gap reversal in participation to post-secondary education is becoming 
ubiquitous globally. In a large majority of the countries in the sample (~90%), males in the earliest 
cohort outnumber females among participants to tertiary education. In the youngest cohort born 
in 1980, 64% of countries have seen their gender imbalances in post-secondary attainment revert 
over time (Table I). When averaged over all countries in the sample, the data show a reversal in 


the gender gap, reported in Figure I. The timing of the reversal varies across countries and regions. 


>For primary education, we compute non-completion rates for individuals born between 1936 to 1995, from 1931 
to 1990 for secondary school non-completion and from 1921 to 1980 for tertiary education. 

Countries were dropped from the sample due to either having missing or zero cohort/country observations, a ratio 
of female/male cohort/country tertiary enrollment observation above 5, or a cohort/country male population below 
10,000. 

7Pekkarinen (2012) has also reported this phenomenon for Scandinavian countries. 


Advanced economies, Latin America and Europe and Central Asia have seen their gender gaps 
revert to a high proportion, but other regions lag behind, such as Sub-Saharan Africa with only 8% 


of countries having experienced the reversal. 


Figure I: The Gender Gap Reversal in Educational Attainment 


Panel A: Tertiary enrollment Panel C: Primary non—completion 
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Notes. The black line represents the unweighted average for all countries in the sample. Averages are unweighted by 
country size. In each panel, countries were dropped for having missing or zero cohort/country observations, a ratio of 
female to male ratio for tertiary enrollment observation above 5, or a cohort/country male population below 10,000. 
The total sample size is 115 (Panel A), 135 (Panel B) and 125 (Panel C) countries. See Table I for a breakdown of 
sample size by country group. 

The gender gap reversal in education is not limited to tertiary education. The gender composition 
of secondary school non-completers has also reversed from female majority to male majority over 
time. Males now outnumber females among low educational achievers and this reversal is also 
ubiquitous globally. Females in the earliest cohort in the Barro-Lee database outnumber males as 
secondary-school non-completers in over 95 percent of countries in the sample. For the youngest 


cohort, the gender gap has reversed in 70% of countries. The reversal occurred in most regions and 


is also observed when statistics are averaged over all countries in the sample, as shown in Figure I 


and Table I. In Sub-Saharan Africa, only 21% of countries have experienced the reversal. 


For primary school non-completion, a reversal of the gender gap from female majority to male 
majority also took place, although patterns are not as clear cut. Country-specific results are more 
unstable overall, as in many countries, particularly high income countries, primary school non- 
completion has been rare, even in older cohorts. Panel C of Figure I shows that, when educational 
attainment is aggregated by region, most regions experienced the reversal: while the number of 
females not completing primary school was higher than males in earlier cohorts, more males than 


females do not complete primary education among younger cohorts. 


Table I: Share of Countries that Experienced the Gender Gap Reversal, by Birth Cohort 


Panel A: Tertiary education 


Birth cohort 1921 1926 1931 1936 1941 1946 1951 1956 1961 1966 1971 1976 N 
Middle East and 

North Africa 0 0 0 0 0 0 0.10 0.10 0.20 0.20 0.20 0.30 10 
Sub-Saharan Africa 0 0 0 0 0 0 0 0 0.04 0.04 0.04 0.04 24 
Advanced Economies 0 0 0 0 0 0 0 0.04 0.48 0.52 0.70 0.78 23 
Latin America and 

the Caribbean 0 0 0 0 0 0 0 0.10 0.35 0.40 0.45 0.70 20 
South Asia 0 0 0 0 0 0 0 0 0.25 0.25 0.25 0.25 4 
East Asia and 

the Pacific 0 0 0 0 0 0 0 0 0.07 0.07 0.21 0.29 «14 
Europe and 

Central Asia 0 0 0 0 0 0.05 0.25 0.35 0.65 0.70 0.85 0.90 20 
Panel B: Secondary education 

Birth cohort 1931 1936 1941 1946 1951 1956 1961 1966 1971 1976 1981 1986 N 
Middle East and 

North Africa 0 0 0 0 0.07 0.07 0.14 0.14 0.29 0.36 0.50 0.57 14 
Sub-Saharan Africa 0 0 0 0 0 0.03 0.06 0.09 0.12 0.15 0.18 0.21 33 
Advanced Economies 0 0 0 0 0.09 0.13 0.26 0.48 0.65 0.70 0.70 O78 23 
Latin America and 

the Caribbean 0 0 0.09 0.09 0.17 0.22 0.35 0.48 0.52 0.57 0.65 0.65 23 
South Asia 0 0 0 0 0 0 0.17 0.17 0.17 0.17 0.33 0.50 6 
East Asia and 

the Pacific 0 0 0 0 0 0 0.19 0.31 0.50 0.50 0.50 0.50 16 
Europe and 

Central Asia 0 0 0.10 0.10 0.15 0.15 0.30 =: 0.30 0.45 0.60 0.70 0.80 20 
Panel C: Primary education 

Birth cohort 1936 1941 1946 1951 1956 1961 1966 1971 1976 1981 1986 1991 N 
Middle East and 

North Africa 0 0 0 0 0 0 0 0 0 0 0.08 0.15 13 
Sub-Saharan Africa 0 0 0 0 0 0 0 0 0 0.03 0.03 0.13 30 
Advanced Economies 0 0 0 0 0 0 0.05 0.14 0.19 0.24 0.38 0.52 21 
Latin America and 

the Caribbean 0 0 0 0 0 0 0 0 0 0.04 0.04 0.13 23 
South Asia 0 0 0 0 0 0 0 0 0 0 0 0 5 
East Asia and 

the Pacific 0 0 0 0 0 0 0 0 0 0.14 0.14 0.14 14 
Europe and 

Central Asia 0 0 0 0 0 0 0.05 0.05 0.21 0.26 0.53 0.58 19 


Notes. Panel A. A reversal is defined to happen when the female-to-male ratio among individuals that received some 
tertiary education increases above | and does not revert back to below |. The total sample size is 115 countries. 
Panel B. A reversal is defined to take place, when the female-to-male ratio among those that did not complete 
secondary education falls below 1 and does not revert back to above 1. The total sample size is 135 countries. Panel 
C. A reversal is defined to take place, when the female-to-male ratio among those that did not complete primary 
education falls below 1 and does not revert back to above |. The total sample size is 125 countries. 

Source. Barro-Lee database 2013. 


3 Theoretical Framework 


3.1 Micro Foundations 


The economy is assumed to be populated by individuals that differ in their academic aptitude z, 
which is continuous and perfectly observed by individuals.* For the sake of simplicity, a single- 
period model is assumed in which individuals receive the benefits of their investment in education 
in the same period as they invest. Individuals choose years of schooling s to maximize their utility 


U. Building on Card (1994), we express the utility function of individuals in the economy as: 


where B(s) denotes the benefit function of schooling, with B’(s) > 0 and B”(s) < 0. C(s) is the 
cost function of schooling, which is increasing and convex in s. The first-order conditions for the 


individual maximization problem read as: 


where B’(s) and C’(s) are the marginal benefits and costs of schooling, respectively. Following 


Card (1994), we linearize the model by assuming that B’(s) and C’(s) are linear functions of s, 


8We use the term academic aptitude in the theoretical framework, but we only observe achievement test scores 
empirically. As Heckman and Kautz (2012) note, test scores are the observable product of a complex combination 
of cognitive and non-cognitive skills, as well as effort and motivation. Understanding the mapping from academic 
aptitude to test scores is beyond the scope of this paper, and we refer to the combination of cognitive and non-cognitive 
abilities captured by test scores as academic aptitude in the remainder of the paper. 


with B’(s) having an individual-specific intercept: 


B'(s) = 25 kis 


C"(s) = kgs, 


where k; > 0 and kg > 0. Intuitively, individuals with higher academic ability z; receive higher 
marginal benefits from schooling.’ In this framework, the optimal level of schooling s chosen by 
individual 7 is: 

where b = Kak is an exogenous technology parameter, which we refer to as the structural net 


benefits of schooling (hereafter: net benefits), identical for all individuals in the economy. The net 


benefits capture the monetary benefits, non-monetary benefits and costs of schooling. 


The optimal value of schooling chosen by individual 7 is therefore strictly increasing in individual 
academic aptitude z,;. In this framework, the minimum level of academic aptitude Zz so that indi- 


viduals choose a given level of schooling s, such as tertiary education, can be expressed as: 


(2) 


Equation 2 states that individuals whose academic aptitude is below the threshold Zz choose an 


optimal level of education below 5s, while individuals whose ability is equal to or greater than Z 


*In the original model of Card (1994), the expression for C’(s) also includes an individual-specific intercept, cap- 
turing individual-specific circumstances such as access to wealth and network or taste for education. For the sake of 
simplicity, we abstract from this distinction in our model. 


10 


choose 5 and above. It also implies that the ability threshold is determined by b, common to all 


individuals in a given cohort. An immediate implication of Equation 2 is: 


OZ 
ere a : 
ap = 9 


In words, the minimum level of ability required to attend a given level of schooling s, such as 


tertiary education, decreases with the net benefits of investing in schooling in the economy. 


3.2 Aggregate Enrollment Rate 


We now assume that the economy is populated by successive cohorts. Each cohort comprises a 
continuum of agents that invest in schooling and differ in their level of academic aptitude z. Let 
f.(Z) denote the probability density function of academic aptitude z in the population of a given 


cohort. The complementary cumulative distribution function of z is defined as: 


All individuals belonging to the same cohort are exposed to the same value of the exogenous 
parameter b, = ae regardless of their level of ability. Given the micro properties of the 
model expressed in Equations (1) and (2), the share of individuals in the cohort choosing a level of 


schooling at least equal to § is: 


11 


Equation (3) states that the mass of individuals choosing a level of schooling of at least 5 or higher, 
for a given value of b, is made of all individuals whose ability is above the ability threshold 2Z. 
Exogenously to individual schooling decisions, b varies across cohorts. Equation (3) implies: 


OP(2z) 
ab 


cat 


The share of individuals choosing a level of schooling equal or higher than 5 increases with the 
net benefits of schooling b. One key implication at the aggregate level is that, as the net benefits 
of education in the economy rise, enrollment increases and the mean aptitude of individuals en- 
rolled decreases. In Appendix A1.2, we show that this result is empirically supported, using US 


data. 


4 Tail and Mean Dynamics: Hypotheses for the Gender Gap 


Reversal 


We formulate three hypotheses for the gender gap reversal, summarized in Table ??. These hy- 
potheses can be grouped into two broader categories: the tail dynamics hypothesis (or tail hypoth- 
esis) and the mean dynamics hypotheses (or mean hypotheses). The validity of the assumptions 
behind these hypotheses is discussed in sections 5 and 6 and their predictions are tested in section 
7. We study the two mean hypotheses under the same category, since they are, as we demonstrate, 


very similar. 


The tail hypothesis states that G.(-) is gender specific with Var|z,] > Var|z,s|. In words, aca- 


12 


demic aptitude for males and females, z,,, and z+¢ respectively, have different distributions with a 
Pp f Tesp y 

greater variance among males. The net benefits of education b are assumed to be identical between 

genders. The increase in b (or, equivalently, the decrease in the aptitude threshold Z) over time, 


combined with the greater dispersion of aptitude among males, produces the reversal. 


The mean benefits hypothesis (MBH) claims that the net benefits of education differ between gen- 
ders, i.e. there exist gender-specific by and b,, (or, equivalently, gender-specific ability thresholds 
Zp and Z,,) that have different time dynamics over time. Prior to the reversal, by < b,, (or, equiva- 
lently Z > Zm) before by progressively converges towards b,,, and surpasses it over time, generat- 


ing the reversal. Ability distributions G'.,(-) are assumed to be identical for both genders. 


The mean performance hypothesis (MPH) claims that the mean of academic aptitude for females 
E|z,| has increased over time and progressively surpassed E|z,,,], producing the gender gap rever- 
sal. The variance of z and the net benefits of education (or, equivalently, the academic aptitude 


threshold Z) are identical for both genders. 
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Figure I: Three Hypotheses for the Gender Gap Reversal in Education 


—- males 
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Notes. Panel A: Z is the threshold of academic aptitude above which individuals enroll at a given level of schooling 
8, such as tertiary education. The grey arrow indicates the change in z over time. Panel B: The grey arrows indicate 
a faster decrease of Z, relative to Z,,, the female and male-specific ability thresholds, over time. The distribution of 
academic aptitude by gender are overlapping as they are identical under the mean benefits hypothesis. Panel C: The 
arrow pointing to the right indicate an increase in the mean of the ability distribution for females, j1, relative to males 
over time as Z decreases over time. 
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5 The Tail Dynamics Hypothesis 


5.1 The Relationship between Enrollment Rate in Education and Gender 


Composition of Individuals Enrolled 


Machin and Pekkarinen (2008) have shown that the test score distribution of males exhibits a larger 
variance than females in a large sample of OECD countries. The authors use data from the 2003 
Project for Student International Assessment (PISA) which tests nationally representative samples 
of 15 year old students from 40 countries in mathematics and reading. They report that males’ test 
score variance is strictly greater than females’ in 38 countries for mathematics and 39 countries for 
reading. The gender difference in variance is statistically significant in all but five countries with 


an average male-to-female variance ratio of 1.21 for reading and 1.20 for mathematics. 


Although this evidence is recently known to economists, extensive and long-standing evidence on 
the larger variance in the distribution of some skills among males has been reported in the psychol- 
ogy literature.!° In line with this finding, Becker et al. (2010a) and Becker et al. (2010b) suggested 
that a lower variance of non-cognitive skills among females could explain the gender gap reversal 


in education by inducing a higher elasticity of females’ schooling to returns to education. 


In our framework, let z,, and z» be random variables that denote the academic aptitude for males 
and females, and f,(Z,,) and f,(Z,) their density functions. The tail hypothesis assumes Var[z] > 
Var([zy], where the distribution of z,,, and zy is invariant over time. No assumption is imposed on 


the relative value of E|z,,| and E[z,/]. We denote 1 and o? the mean and variance of the distri- 


'0See Frasier (1919), Hedges and Nowell (1995) or Jacob (2002), among many others. 
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butions, respectively. For empirical estimations and illustrations, we assume that z,, and zy are 


normally distributed, as it is the closest approximation of empirical test score distributions. 


In our framework, the enrollment rate in a given level of schooling s among the population of a 
given cohort is given by: 


Gx (2.107) + Gx (2 fmm) 


PTH(.) = : | 


and the female-to-male ratio among individuals enrolled, denoted R(zZ), can be expressed as: 


Panel A of Figure III displays the two distributions of academic aptitudes, when O;,>07 under the 
tail hypothesis, as well as the ability threshold Z that truncates the distribution of those enrolling 
at higher levels of schooling. Individuals on the right side of the ability threshold enroll, while 
those on the left side do not. Panel B displays the two corresponding complementary cumulative 
distribution functions (CCDFs) under that assumption, where the reversal occurs when the two 


CCDFs cross. Figure IV illustrates the relationship between total enrollment rate and the gender 


Gur (2) 
Gem (2) 


ratio among individuals enrolled, where the reversal occurs when the ratio reaches 1. 


A decrease in the ability threshold for enrolling, 7, associated with an increase in the net benefits of 
education b and leading to an increase in the enrollment rate P, is required for the reversal to occur 
under the tail hypothesis. In Appendix A1.1, we report evidence showing that the mean ability of 


individuals attending tertiary education indeed decreased over time, using US data. 
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Figure III: Distribution Functions of Test Scores by Gender under the Tail Hypothesis: Illustration 


PANEL A 


> 
test-taking ability, z 
Ga) cies) 


Gi 


Notes. Panel A: The two curves show the probability density functions of academic aptitude, z, among males (full line) 
and females (dashed line), with Tin? OF under the tail dynamics hypothesis and z being normally distributed. Panel 
B: The two curves show the complementary cumulative distribution function (CCDF), resulting from the integration 
from +00 to z of f.,(z) and f,,,,(z). The grey arrow indicates a decrease in Z. The gender gap reversal in enrollment 
occurs when the ratio of the two CCDFs reaches 1, as indicated in Panel B. 


Under the assumption that o?, > OF, it can be shown that the relationship between the female-to- 
male ratio among individuals enrolled at a given level of schooling, denoted R™# and the enrolled 


rate P?" has three notable properties: 


Proposition 1. The female-to-male ratio R™ tends to zero when the total enrollment rate P™# 


tends to zero. 


Proposition 2. The female-to-male ratio R™ tends to one when the total enrollment rate P™™ 


tends to one. 
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Figure IV: Relationship between the Enrollment Rate in Tertiary Education and the 
Female-to-male Ratio among Individuals Enrolled under the Tail Hypothesis: Illustration 


G,(Z) 
Gn(Z) 


n 


T T is 
0 0.5 1 Gn(Z) + GZ) 
2 


Notes. The x-axis reports the enrollment rate. The y-axis female-to-male ratio among individuals that enroll. The 
gender gap reversal occurs when the female-to-male ratio crosses y=1. 


Proposition 3. There exists a value of P™™ € [0,1[ such that R™ = 1. This value is unique and 


always exists. 
Proof. See Appendix A4. 


The distributional assumption 07, > oF is necessary and sufficient for Proposition 1 to 3 to hold, 
without any condition imposed on the relative values of fz, and j1r. In addition, Propositions 1 
to 3 also hold when z is assumed to follow two-parameter probability distribution functions other 


than the normal distribution.!! 


5.2. Empirical Estimation 


From the Barro-Lee database, we observe the enrollment rate for a given level of education in 


country 7 and cohort t, denoted x;,, and the gender ratio among enrolled individuals, denoted y;z. 


' Proofs for distributions other than the normal are available upon request. 
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In our model P?“(-) and R7“(-), the theoretical equivalents of x; and y,,, are determined by 
the distributions of aptitude by gender, G,,, G,,, and the ability threshold z. Unlike 2,, x; is 
observable, and the two distributions are assumed to be fully characterized by the four-parameter 


vector { fm, 77,, Mp, OF}. 


Without loss of generality, the parameters of the model can be reduced to two, by normalizing one 
of the two probability density functions. We standardize the female probability density function 
such that f.,(Zie) ~ N(0,1) and denote (j1;, 07) the first two moments of the males’ distribution 


of academic aptitudes relative to females in country 2: 


Hi = Him — Mi,f = Him 


The model under the tail hypothesis predicts a unique value %;,, conditional on the triple {2}, 11;, 07}. 
To fit the model, we estimate {j:;;0?} for country i by maximum-likelihood estimation from the 


following equation: 


Gz, (Zit) 
—— : 4 4 
Vit em Ge pis o2) exp(Eix), ( ) 


where €;, ~ N(0,02). Taking the logarithm of Equation 4 and substituting 2, = P~* (xj, [1;, 07) 


yields the final econometric model: 


log yi = log Gz,(P7*(wit, 1s, 07)) — log G.,, (P~* (its Mis 07), bay 07) + Ei 
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As the error term ¢;; is normally distributed, the model can be fitted numerically by finding the 
values that minimize the sum of squared errors. Thus, the model is a non-linear mapping from x 
to y defined by the parameters of the males’ ability distribution relative to the females’ distribu- 


tion. 


Figures A3 and A4 report the estimated relationship between the enrollment rate in tertiary educa- 
tion x and the female-to-male ratio among individuals enrolled y, when {j1; 07} are fitted with the 
model under the tail hypothesis. The model generates an accurate fit for the relationship between 
x and y observed in countries in the sample.'* Figures A5 and A6 also shows our model fit for 
the relationship between the secondary school non-completion rate, and the gender ratio among 


non-completers. 


6 The Mean Dynamics Hypotheses 


6.1 The Mean Benefits Hypothesis (MBH) 


Goldin et al. (2006) and Chiappori et al. (2009) suggested that the returns from education, including 
labor market and marriage market returns, may have risen more for women over the past decades, 
which could have driven the gender gap reversal. Both contributions invoke the progressive re- 
moval of female career barriers, as a driver of increased returns. Goldin et al. (2006) associate 
the removal of barriers to female employment to a progressive change in fertility and marriage 


patterns, driven by the access to reliable contraception (Goldin and Katz (2002)). The authors 


'2Due to space limitations, we only report results for a subset of countries in the sample. Results for the full set of 
countries are available upon request. 
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posit that this raised females’ expectations regarding future labor market outcomes, increased la- 
bor force participation, and moved female employment out of traditionally female occupations. In 
a similar spirit, Chiappori et al. (2009) argue that technological progress has freed women from 
many domestic tasks, which disproportionally increased the labor market and marriage market re- 
turns of schooling for females. According to both contributions, the disproportionate increase in 


the benefits of education for females relative to males generated the reversal. 


We formulate the mean benefits hypothesis in our framework by allowing 6 to differ between 
genders, where b,,, and by denote the net benefits of education for males and females, respectively. 
by and b,,, are allowed to have different dynamics over time. G,(.) is assumed to be identical 
for males and females. Under the MBH, the optimal level of schooling chosen by individuals is 


gender-specific: 


or, equivalently, Z, is gender specific: 


(5) 


where g = {m; f}, with m standing for males and f for females. Equation 5 states that there exists 
gender-specific ability thresholds such that males and females enroll in a given level of schooling 
S, denoted Z,,, and z respectively. The enrollment rate in a given level of schooling § reads: 


Gelzany  Galezz) 


PMBH 5 


(6) 


Zfy Zin) = 


Under the MBH, a shift from b,, > by to bm < by over time, or equivalently, from Z, < Z, to 
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2m > Zy, 18 a necessary and sufficient condition for the gender gap reversal in education to occur. 


The gender ratio among individuals enrolled in a given level of education § is given by: 


Rear G.(Zs) (7) 


25, 2m) = 


and: 
BMS ey) ah iy By 
BUSES ls te oe 


Sl ae 


Using the Barro-Lee data, we calculate the parameter values of the model by solving the system of 
two equations (6 and 7) and two unknowns {2Z;,, Zm,} for each cohort t, where P”?" (Z>,, Zn.) = 
y, and RMP4(z, Z,,) = av. The estimated Z,/, and Z,,, for tertiary and secondary education, 


averaged by country group, are reported in Figure A7. 


We now assess the validity of the assumptions of the MBH. A necessary and sufficient condition 


for the reversal in the education gender gap under the MBH is: 


MBH MBH 
bas = Dry > 0 


MBH MBH 
Dintn —Ofig <9, 


where ¢; is a time prior to the reversal, and tz a time post reversal. Under the MBH, females used 
to enroll less than males conditional on test scores prior to the reversal, but now have a higher 
propensity to enroll. Under the tail hypothesis, on the other hand, there is no gender-specific time 


trend in the relationship between academic aptitude and enrollment. 


pub, 


A monetary component of the net benefits of education is the college wage premium, defined as 
the wage difference between individuals that attended tertiary education and those who did not. 
While Chiappori et al. (2009), Card and Lemieux (2000), Charles and Luoh (2003) and Trostel 
et al. (2002) find a higher college wage premium for women in recent years, this can only explain 
the reversal if it constitutes a shift compared to earlier years.'? Cho (2007) shows that trends in the 
college premium have been very similar for men and women over the period 1970—2000, using data 
from the CPS. Becker et al. (2010b) also find little difference in the premiums for men and women 
since the 1980s, with both premiums growing at about the same rate over time. Considering the 
non-monetary benefits of education, the authors do not find clear evidence for a greater marriage 
market improvement for college-educated men relative to college educated-women. They also 
show that the effect of college on life expectancy remains greater for men despite a narrowing of 


the gender gap, which cannot justify that women now enroll more in college relative to men. 


Testing these alternative hypotheses requires comparable data on test scores before and after the 
reversal that can be linked to enrollment decisions by gender in both periods. Surveys also need to 
be representative of the population of males and females in the country. According to the Barro-Lee 
time-series, the reversal in participation to tertiary education occurred in the mid-1980s in the US. 
We therefore need to observe the relationship between test scores and post-secondary enrollment 


before and after the mid 80s, with a sufficient time gap to capture time trends. 


We exploit two nationally-representative longitudinal surveys for the US: the High School and 
Beyond (HS&B), starting in 1980, and the Education Longitudinal Survey (ELS), with base year 


in 2002. Both surveys contain information on standardized mathematics and reading tests in grade 


3The methodology behind the findings of a higher college premium for females in the US has however been 
challenged by Hubbard (2011). 
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10, that can be linked to tertiary education attendance a few years later. These tests are taken at an 
age at which school is still compulsory, which ensures having a consistent estimate of average test 


scores by gender for the entire population of a specific age group. 


Using these two surveys, we estimate the empirical relationship between academic aptitude prox- 
ied by achievement test scores, and enrollment in tertiary education, separately by gender. We 
estimate a binary version of Equation | in our model, where the dependent variable is tertiary ed- 
ucation attendance, denoted H. H is observable and is a function of the latent variable s* in the 


model: 


1 if s* > 


Wal] 


H(s*) = 


WI 


0 if s* < 


We estimate (3, the net benefits to tertiary education, from the linear probability model’*: 


A; = Bgzj + €;, 


where g € {m, f} and {, is estimated in both 1980 and 2002, and z is measured by achievement 


test scores at age 15. 


Table II reports our estimates for (8, 11980, 2 ¢.z=1980) ANd (8m,z=2002, B¢,2=2002) Using a linear prob- 
ability model. The estimated Bn — 8B '¢ 1S negative in both 1980 and 2002, meaning that for females, 


academic aptitude is a better predictor of tertiary education enrollment. The benefits of education 


4This setting allows to study how well aptitude maps to educational enrollment. While our theoretical model 
produces a discrete jump in the probability to enroll from 0 to 1 above a given threshold of academic aptitude, many 
additional factors correlated or uncorrelated with aptitude are at play empirically. it is therefore as strongly simplified 
model of the determinants of enrollment in education. Empirically, we study actual probabilistic data and estimate a 
linear binary model where the likelihood to enroll increases with test scores in a continuous fashion. we believe the 
test is reasonable as we are primarily interested in general trends and signs or the relationships, rather than the exact 
magnitude of those relationships. 
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Table II: Model Estimates of the Relationship between Academic Aptitude and Participation to 
Tertiary Education in 1980 and 2002, by Gender 


Dependent Variable: Dummy 
for Tertiary Education Attendance 


1980 2002 
Males Females M-F Diff. Males Females M.-F Diff. 
Brn 0.255*** 0.414*#* 
(0.011) (0.06) 
By 0.329*** 0.479*** 
(0.010) (0.02) 
Bs — Bm -0.074*** -0.065*** 
N. observations 6,066 6,427 - 6,023 6,515 - 


Notes. ***: significant at the 1% level, **: significant at the 5% level, *: significant at the 
10% level. Standard errors are reported in parentheses. The regressions are linear probability 
models run by ordinary least squares. Regressions do not include an intercept. Test scores are 
standardized to have a mean of zero and a standard deviation of | in the full sample. 

Sources. High School and Beyond, sophomore cohort: base year (1980) and 2nd follow-up 
(1984), US Educational Longitudinal Study, sophomore cohort: base year (2002) and 2nd 
follow-up (2006) 


were higher already for females in 1980, prior to the reversal, and the estimated ee f 1S very 
stable between 1980 and 2002. If anything, there was a slight increase in b for males relative to 


females from —0.74 to —0.65. This finding is not consistent with the MBH. 


6.2 The Mean Performance Hypothesis (MPH) 


A change in the distribution of z for females relative to males over time, in particular a relative 
increase in females’ mean academic aptitude, can also generate the gender gap reversal in our 
framework. Such changes over time have been suggested as potential explanations for the reversal 
by Goldin et al. (2006) and Cho (2007) for the US. Goldin et al. (2006) use samples of high school 
graduating seniors from three US longitudinal surveys in 1957, 1972, 1988, among which the last 


two are nationally representative. Looking at achievement test scores, they find that girls reduced 
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their disadvantage in math, and increased their advantage in reading between 1972 and 1992. Using 
three nationally representative longitudinal data sets of high school students, Cho (2007) also looks 
at the evolution of female performance in high school over time. He reports that females’ mean 
high school test scores have increased more rapidly than males’ over the past three decades. Using 
a simple Oaxaca decomposition, he finds that women’s progress in high school achievement can 
account for more than half of the change in the college enrollment gender gap over the past thirty 
years. Fortin et al. (2015) use self-reported grades, rather than aptitude test scores, to look at the 
evolution of female high-school performance over time. Using a sample of 12th graders from the 
Monitoring the Future (MTF) study in 1976 and 1991, they report that the gender gap in the mean 
grades of high school seniors remained very stable since the 1970s. They however find that the 


mode of girls has shifted upwards over the period 1980-2010 compared to boys’. 


We formalize the MPH in our framework as an increase in the mean aptitude for females relative 
to males over time, which, as we illustrate, can also generate the reversal. Under the MPH, as 
under the tail hypothesis, there exists a lower bound of academic aptitude Z such that individuals 
choose a given level of schooling s. Making a normalizing assumption of /1,,, = 0 for all ¢, the 


total enrollment rate in a given level of schooling 5 is given by: 


Gan (Zt) + Gas (Up ts 2) 


Fates | , 


Lf,ts Zt) = (8) 


A change from E[zp4| < E[Zm] to Elzs1] > El zm] is a necessary and sufficient condition for the 


gender gap reversal in education to occur, when the distributions have identical variance. Under 
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the MPH, the gender ratio among individuals enrolled can be expressed as: 


Gz; Abp ts %) 


RMPH > 
(pes 2) CA) ’ 


(9) 


and: 


RMPH <1 when E[zsi] < Elzml 
RMPH = 10 when Elzee] = Elzm] 


RMPH S11 when Elzez] > Elm] 


We calculate the parameter values of the model by solving the system of two equations (8 and 9) 


Pye V hacea ad 


and two unknowns {jr +, 2,} for each period t, where a 


Lya, 2) = Y, and 
x;y. The estimated ju, and 2; for tertiary and secondary education, averaged by country group, are 


reported in Figure A8. 


To evaluate whether the academic achievement of females increased over time relative to boys for 
the entire population, we need to use a representative sample of the population of a given age group. 
This precludes the use of test score data at an age at which schooling is no longer compulsory, in 
order to avoid gender-biased estimates due to early school dropouts.'° In addition, one needs to 
rely on achievement test scores rather than grades, as grades can be subject to multiple biases when 


measuring scholastic achievement. 


We first investigate this question in the US context, by looking at the evolution of the academic ap- 
titude of 10th graders by gender over the period 1980-2002. We use two nationally-representative 


longitudinal surveys for the US, the High School and Beyond (HS&B) started in 1980 and the 


'SAs pointed out by Fortin et al. (2015), the sample of test-takers is otherwise positively selected in a way that is 
likely to be gender specific, as we showed that males are more likely to be early dropouts than females. 
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Table III: Gender Gap in Average Achievement Test score in Mathematics Prior and 
Post-reversal, US Estimates 


1980 2002 
Males Females M-F mean M-to-F Males Females M-F mean M-to-F 
difference Var. ratio difference Var. ratio 
Mean score 0.062 -0.059 0.134*** - 0.053 -0.053 0.106*** - 
Test scoresd 1.033 0.964 - 1:15*** 0.959 1.037 - 1.17*** 
Observations 6,066 6,427 - - 6,023 6,515 - - 


Notes. ***: significant at the 1% level, **: significant at the 5% level, *: significant at the 10% level. Standard errors 
are reported in parentheses. Test scores are standardized to have a mean of zero and a standard deviation of 1 in the 


full sample. 
Sources. High School and Beyond 1980 (base year) and US Educational Longitudinal Study 2002 (base year). 


Education Longitudinal Study (ELS) initiated in 2002. Both surveys contain information on stan- 
dardized test scores in mathematics in 10th grade that are comparable between the two years. The 
results are reported in Table III. The table shows that the girls’ disadvantage decreased from - 
0.134 to -0.106 standard deviation from 1980 to 2002 which is statistically significant, suggesting 


an increase in mean z for females relative to males. 


We also conduct a similar exercise for the UK at age 11, and for OECD countries using PISA test 
scores data over the past 15 years. Results are reported in Appendix A2 and show little change 
in the gender gap gap in mean achievement test score over time. Evidence about an increase in 


females’ mean performance over time is therefore ambiguous overall. 


6.3. The Similarity between the MBH and MPH 


The MPH in our framework models the gender gap reversal such that females initially have lower 
participation rates in education, due to a lower Ez]. Conditional on z, the net benefits of schooling 


and the optional level of schooling chosen are identical for both genders: E|s;-|z] = E|s;,|z]. The 
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MPH and MBH thus differ in the interpretation of the source of the dynamics in the gender mean 
differences. They are, in the way they are modeled in our framework, algebraically very similar. 
It can be shown that if academic aptitude is normally distributed, they are actually identical within 
the intersection of the domains of the two distributions (See Appendix A5). As a result, the closer 


the CCDFs of z are to linear, the more similar the two hypotheses will be in our framework. 


7 Testing Predictions of the Hypotheses 


7.1 Tail Hypothesis Model Estimates against PISA Test Score Distribution 


Parameters 


To assess the validity of the tail hypothesis, we predict the female-to-male ratio in tertiary enroll- 
ment with the mean and coefficient of variation parameter values obtained from our fit with the 
Barro—Lee data (Section 5). We predict the ratio for alternative values of the enrollment rate: 0.20, 
0.50, 0.70 and 0.90.'° We then conduct the same exercise by using instead the distribution param- 
eters from the OECD PISA test scores. We then measure the correlation between the two vectors 
of predicted female-to-male ratios for countries that are in both the Barro—Lee and PISA datasets. 


In total, 56 countries are common to the two sources. 


The intuition behind this exercise is to test whether the model under the tail hypothesis carries 
information on country-specific gender differences in test score distributions, where the PISA data 


are used as a benchmark to assess the tail hypothesis estimates. A strong positive correlation 


!6We use the coefficient of variation instead of variance to normalize the distributions. 
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Table IV: Correlations between the Gender Ratio y from the Model Estimates under the Tail 
Hypothesis and PISA Estimates, for Alternative Values of the Enrollment Rate x 


Predicted % - Model estimates 


c=0.2 4=0.5 zr=07 «£=0.9 
Predicted y - PISA Math 0.029 0.032 0.064 0.178 
(0.834) (0.817) (0.638) (0.189) 
Predicted 7 - PISA Reading 0.311** 0.349*** 0.308***  0.304** 
(0.02) (0.008) (0.002) (0.023) 
Sources. PISA 2012 and Barro-Lee database 2013. 
Notes. ***: significant at the 1% level, **: significant at the 5% level, *: significant at the 10% level. Sample size is 
56 countries. p-values are reported into parentheses. 


suggests that the dynamics of participation in education by gender carries information on gender 
differences in test scores by country, providing supportive evidence for the tail dynamics hypothe- 


sis. 


Among available datasets, the PISA dataset provides a suitable benchmark to evaluate our esti- 
mates. The PISA sample was designed to be representative of the population of 15-year-olds in a 
given country. It surveys individuals in schools before the end of compulsory education in most 
countries, which is important in our setting to obtain estimates of the entire population of males 
and females in a given age group. In addition, PISA test score estimates of the population of 


15-year-olds are comparable across a large sample of countries. 


Table IV reports the correlation between the gender ratio predicted from the two sets of parameter 
estimates, for different values of the enrollment rate. Correlations between the parameter values 
predicted by the tail hypothesis and PISA reading test are positive and statistically significant. The 
correlation is approximately 0.30 and robust to alternative values of the enrollment rate. However, 
there is no significant correlation with the math scores. The model under the tail hypothesis appears 


to capture underlying parameters of academic aptitude distributions. 
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7.2 Dynamics of Mean Ability among the Enrolled 


We test the implications of the tail and mean dynamics hypotheses by looking at the evolution of 
the mean ability of the enrolled by gender over time. As illustrated in Figure V, the tail and mean 
dynamics hypotheses generate different predictions about the relationship between the economy- 
wide enrollment rate for a given level of schooling, defined as: 


Gz, (2) Mt: Gam (2m) 


RMSE 5 


2 fs 2m) = 


and the difference in mean academic aptitude among females and males who are enrolled: 


Wee Bx) = Elzslzy > ZF = Ps |e Zan > Pires 


This originates from the fact that the tail hypothesis posits that z and z,, have different distribu- 
tions with Zf = Zm, and vice versa for the MBH. The MBH predicts that the average academic 
aptitude is initially higher for enrolled females than for enrolled males, and progressively con- 
verges towards it before taking lower values. According to the MBH, females are more heavily 
selected or self-selected to attend higher levels of schooling prior to the reversal, conditional on 
aptitude. In contrast, the tail hypothesis assumes that females are underrepresented in the tails of 
the test score distribution, but overrepresented around the mean. This implies that females’ average 


gets higher relative to males as the enrollment rate at higher levels of schooling increases."” 


To assess the validity of these predictions, we use data from several nationally representative 


"When individuals enrolling at a given level of schooling are a representative sample of the full population, the 
observed mean difference becomes an estimate of the mean difference of the population. 
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longitudinal surveys of students in the US conducted in 1972, 1980, 1990 and 2004.'® These 
surveys contain information on test scores of students while they were in secondary school that 
can be linked to post-secondary education attendance a few years later. This allows to estimate 
Elzs|zp > Z| — Elzml2m > 2m], the gender gap in average test-score at age 15 of post-secondary 


students, for different values of the enrollment rate. 


'8Those include the National Longitudinal Study 1972, the High School and Beyond 1980 and the Beginning 
Postsecondary Students Longitudinal Study (BPS) of 1990 and 2004. 
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We complement this information with yearly time-series data on mean test scores by gender from 
the Scholastic Achievement Test (SAT) from 1967 to 2010 in the US. Although these data are 
for the population of students taking the SAT test, and not only for individuals that would attend 
post-secondary education, taking the SAT is highly correlated with post-secondary education at- 
tendance. In addition, changes in gender-specific selection into SAT test-taking over time, as for 
enrollment into post-secondary education, would reflect potential gender-specific determinants of 
b and Z, which we want to test for. As illustrated in Figure V, longitudinal survey data and SAT 
time series report very similar trends on the evolution the the gender gap in mean test scores among 


the test-takers or among the enrolled in post-secondary education. 


The predictions of the two hypotheses with alternative parameter values are reported in Figure 
V. Overall, the tail hypothesis appears to perform slightly better at predicting the relationship be- 
tween the gender gap in test scores and the fraction of the population taking the test or enrolling 
into tertiary education. It predicts a slightly upward trend which is consistent with actual trends, 
except for reading test scores at low levels of enrollment. This portion shows a downward trend 
which is more consistent with the predictions of the mean dynamics hypothesis, before the trend 
reverts at higher levels of enrollment. Although evidence is not clear cut, the predictions of the 
tail hypothesis appear slightly more in line with data trends. Overall, our explanatory work con- 
ducted throughout the paper lends stronger support for the tail hypothesis, as summarized in Table 


V. 
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Figure V: Relationship between Participation to Tertiary Education and Gender Gap in Mean Test 
Score among Participants: Tail vs Mean Hypothesis (MBH) 


a Panel A: Actual trends 2 Panel B: Tail hypothesis prediction 
i/p) ep) 
£& £& 
~ OL ~ O_ 
go go 
{e) ° 
(e) oO 
wn wn 
a7) a7) 
2 2 
0) 0) 
a o_| SOR o BPSO4 a o_| 
5 2 Reading is7¥ & & S °o 
> HSNBB8I BPS90 > 
i BPS04 © 
€ Math HSNB80 mame EK [= 
8 ae g 
ec wo LLS72 = Ww 
Bo a 
= = 
Be) no} 
co) co) 
oO oO 
E E 
b oO bo 
oy oT 
i= I T T T T l i= I T T T T l 
ve 0.0 0.2 0.4 0.6 0.8 1205 42 0.0 0.2 0.4 0.6 0.8 1.0 
Share of cohort attending tertiary education Share of cohort attending tertiary education 
— Tertiaryfit — PISAmath -- PlSAread 
2 Panel C: Mean hypothesis prediction 
wn 
& 
~ oO _ 
vo 
Q 
oO 
wn 
a7) 
£ 
> oO 
fo] 
(O) 
> 
oO 
£ 
Q 
2 
= 
oO 
® 
oO 
E 
e oO 
oO 5 
€ T T 
iz 0.0 0.2 0.4 0.6 0.8 1.0 


Share of cohort attending tertiary education 


— Tertiary fit —  Lowerslope == Higher slope 


Notes. Panel A: The cross and triangle dots report the gender difference in SAT mean scores for each cohort of SAT 
takers, as a function of the share of individuals in the cohort enrolling in tertiary education, from year 1967 to 2016. 
The squared-dots represent the gender difference in mean test scores at age 15 for individuals who enrolled into tertiary 
education, from US longitudinal surveys. Panel B: Predictions of the tail hypothesis were computed using parameter 
values from the least squares fit using the Barro-Lee data. The PISA math and read curves use parameter values from 
the PISA exam scores. Panel C: The tertiary fit depicts the prediction of the mean hypothesis given the time trend 
of the female aptitude threshold Z,, estimated from time-series of the enrollment rate and the gender ratio in the US. 
Lower slope curve and higher slope curve depict the prediction if the time trend of z+ had developed at a 10% slower 
or 10% higher pace that the actual estimate, respectively. 

Sources. College Board for SAT score data. National Longitudinal Study 1972, the High School and Beyond 1980 and 
the Beginning Postsecondary Students Longitudinal Study (BPS) of 1990 and 2004 for mean achievement test scores 
at age 15 of those enrolling in tertiary education. The proportion of individuals in the cohort attending post-secondary 
education is from the US Census. 
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Table V: The Validity of the Gender Gap Reversal Hypotheses 


Hypothesis prediction 


Tail hypothesis Mean hypotheses 
MBH MPH 
Gender gap reversal v v Vv 
Predictions 
PISA test score distributions V (reading, Xx Xx 
not math) 
Mean test score dynamics of the enrolled in USA V (except early V (early - 
reading scores) reading scores) 
Assumptions 
Higher male variance in academic aptitude v Xx Xx 
Non-increase in relative female net benefits to educationin USA Vv x v 
Non-increase in relative female performance internationally v v x 


Notes. ¥ indicates consistency of the prediction of assumption with what is observed in the data. X indicates 
inconsistency with what is observed in the data. 


8 Conclusion 


The origins of the gender gap reversal in education are not fully understood. This paper con- 
tributes to a better understanding of the forces behind this reversal, by modeling alternative hy- 
potheses using a unifying theoretical framework. Importantly, the paper introduced a hypothesis 
for the reversal that has not been explored in the literature. Formulating the tail hypothesis in our 
framework allowed to reconcile two facts observed internationally: the larger variance of men’s 
performance in achievement test scores found in virtually all OECD countries, and the gender gap 
reversal in educational attainment observed in virtually all high-income countries as well as in low 


and middle-income countries. 


We also formalized two types of mean hypotheses, suggested by previous literature. The mean 


benefits hypothesis explained the gender gap reversal by allowing the female net benefits of educa- 
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tion to first catch up and subsequently overtake those of males’. The mean performance hypothesis, 
in turn, posited that a similar dynamic in mean aptitude by gender can account for the reversal. We 


showed that these two mean hypotheses are very similar in nature. 


Since the tail and mean hypotheses can both explain the reversal, we further assessed the un- 
derlying assumptions and predictions of the two hypotheses against the data. We found that the 
assumptions of the tail dynamics hypothesis are well supported, as gender differences in test score 
distribution by country estimated from our model correlate with estimates from the Project for 
International Student Assessment (PISA). Such correlations cannot be explained by previous hy- 
potheses on the gender gap reversal. In contrast, we found limited support for the underlying 


assumptions and predictions of the mean dynamics hypotheses. 


Those findings indicate that the lower variance in test score achievement among females was a 
driver of the observed gender gap reversal in education. The larger variability of men’s achieve- 
ment test scores observed empirically remains mostly unexplained, and could be explored by fur- 
ther research. Our findings outline that the over-representation of boys at the bottom end of school 
performance should be of concern among policy makers. Delving into the origins of the over- 
representation of boys at the bottom of the achievement distribution could help addressing boys’ 
growing educational disadvantaged. This is important as early school dropouts have been linked to 
poor labor market performance, higher poverty incidence but also higher crime rates, particularly 


among males. 


This paper suggests that when looking at gender differences in observable outcomes, it is important 


to go beyond the analysis of means by looking at entire distributions. Fundamentally, when ana- 
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lyzing educational outcomes by gender, the researcher is always looking at truncated distributions. 
In such distributions, the mean is a function of the dispersion of the underlying distribution. Since 
evidence shows a higher dispersion of test scores among males, this effect needs to be accounted 
for, whenever educational achievement is discussed by gender. Our findings also suggest that the 
larger variance of traits among males may be relevant to explain gender differences in other areas. 
Building on this fact could be a direction for future research in labor economics or other fields of 


economics. 
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Appendix 


Al _ Testing the Model’s hypotheses 


Al1.1_ Increase in 6 over time 


The exogenous change in b over time is a key assumption of our model. To test this assumption 
empirically, we estimate a binary version of Equation 1, where the dependent variable is tertiary 
education attendance, denoted H. H is observable and is a function of the latent variable s* in the 
model: 

ih ifs*>5 

HS) = 

0 ifs* <5 

We estimate b, the net benefits to tertiary education in our model, from the linear probability 


model: 


Ay = Bz; + €;, 


where (3 denotes the estimate of b. According to the Barro-Lee data, the gender gap reversal in 
participation to tertiary education occurred in the mid 1980s in the US. The data used for the esti- 
mation is from another longitudinal surveys of 10th graders, the High school and Beyond (HS&B), 
which started in 1980. As for the ELS 2002, this study collects 10th grade test scores that can be 
linked to tertiary education attendance a few years later, allowing to estimate changes in the value 


of b over the 20-year period in which the reversal occurred. 


Table Al reports our estimates for b in the US in 1980 and 2002 using Equation (1) from our 
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Table Al: Model Estimates of the Relationship between Academic Aptitude and Participation to 
Tertiary Education in 1980 and 2002, using US Data 


Dep. Variable: Tertiary 
Education Attendance 
1980 2002 


B 0.278*** — 0.450*** 
(0.008) (0.005) 


Number of Observations 12,493 12,538 

Notes. ***: significant at the 1% level, **: significant at 
the 5% level, *: significant at the 10% level. Standard er- 
rors are reported in parentheses. The regressions are linear 
probability models run by ordinary least squares. Regres- 
sions do not include an intercept. Test score are standard- 
ized to have a mean of 0 and a standard deviation of | in 
the full sample. 

Sources. High School and Beyond, base year (1980) and 
2nd follow-up (1984), US Educational Longitudinal Study, 
base year (2002) and 2nd follow-up (2006). 


framework. It shows that the estimated net benefits of tertiary education increased sharply from 
1980 to 2002, providing empirical support to the second building block of our model. The null 
hypothesis of equality between bj9g9 and bao92 is strongly rejected by statistical tests. This indi- 
cates that individuals with the same level of academic aptitude are more likely to enroll in higher 
education in 2002 than they were in 1980, which is equivalent to higher net benefits of education 


in 2002 compared to 1980 in our framework. 


This evidence is consistent with a large body of literature showing that returns to education, in par- 
ticular returns to tertiary education, have increased over the past decades. Goldin and Katz (2009) 
or Acemoglu and Autor (1998) among others provide consistent evidence of a sharp increase of the 
college wage premium in the US since the beginning of the 1970s.'? Card and Lemieux (2000) also 


report an important increase in the wage premium of university graduates relative to high school 


The college wage premium is defined as the wage of college-educated workers relative to the wage of high-school 
educated workers. 


4] 


graduates in the UK and Canada over the same period. 


A1.2 The Negative Association between the Enrollment Rate and Mean Ability of Individ- 


uals Enrolled 


Given the positive association between individual ability z and optimal level of schooling s* chosen 
by individuals, a rise in 6 leads to both an increase in enrollment, and a decrease in Z, the ability 
threshold for attending a given level of schooling s. We do not observe Zz empirically, but one 
implication of a decrease in Z with no accompanying increase in the mean of the distribution is a 
decrease in the mean ability of individuals enrolled, which can be tested. To do so, we estimate the 
average verbal test score of individuals attending tertiary education in the US over the period 1975- 
2010, using data from the General Social Survey (GSS). From 1974 onwards, the GSS includes 
a short 10-item multiple choice test assessing vocabulary knowledge of respondents. A measure 
of educational attainment in years is also reported, allowing to identify individuals that attended 


post-secondary education. 


Figure Al depicts the evolution of the average verbal score of post-secondary students relative 
to the entire population, from 1975 to 2010. In 1975, the average cognitive score of students at- 
tending post-secondary education was 0.60 standard deviation higher than the average cognitive 
score of the whole population. This relative difference decreased progressively until 2005 to reach 
approximately 0.30 standard deviations in 2010. Consistent with the assumptions of the model, 
this suggests that greater access to post-secondary education, an increase in P in our framework, 


was accompanied by a decrease in the average ability of individuals enrolling in tertiary educa- 
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tion. 


Figure Al: Empirical Relationship between Enrollment Rate in Tertiary Education and Average 
Verbal Score of Participants in Tertiary Education: 1975-2010 
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Source. US General Social Survey (1975-2010). 
Notes. Dots represent point estimates. Vertical lines represent confidence intervals at the 5%-level. 


A2 Validity of the Mean Performance Hypothesis Assumptions: Interna- 


tional Evidence 


We also test the validity of the mean performance hypothesis assumptions in the UK, using two 
longitudinal surveys with base years in 1958 and 2000. The National Child Development Study 
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Table A2: Gender Gap in Average Achievement Test Score Prior and Post-reversal, UK Estimates 


Panel B: UK, Reading test scores, age 11 


Year 1969 Year 2010 
Female Male M-F mean M-to-F Female Male M-F mean M-to-F 
diff. (ins.d.) Var. ratio diff (in s.d.) Var. ratio 
Mean test score 0.034 ~~ -0.034 -0.068*** 0.034 -0.035 -0.069*** 
Test score sd 0.960 1.038 1.17*** 0.973 1.025 1.12*** 
Observations 868 868 - - 6,557 6,611 - - 


Notes. ***: significant at the 1% level, **: significant at the 5% level, *: significant at the 10% level. Standard errors 
are reported in parentheses. Test scores were standardized to have a mean of 0 and standard deviation of | in the full 
sample. 

Sources. National Child Development Study (2nd follow-up, 1969) and Millennium Cohort Study (2nd follow-up, 
2010). 


(NCDS) was initiated in 1958 and follows a cohort of individuals born in the same week of 1958 
over their lifetime. The third follow-up of the study in 1969 conducted when individual were 11 
years old includes test score results in verbal and mathematics tests. In a similar way, the Mil- 
lennium Cohort Study (MCS) initiated in 2000 follows a cohort of children born at the beginning 
of the 21st century, with a follow-up in 2010 at age 10 that reports individual scores in verbal 
achievement tests. Table A2 reports that the gender gap in mean verbal achievement at age 11 is 
very similar in 1969 and 2010, more than 40 years apart. While males were already lagging behind 
females in verbal achievement at age 10 in 1969, we still observe a similar gap in 2010. Evidence 


from the UK therefore does not seem to support the assumptions of the MPH. 


For international evidence, the Project for International Student Assessment (PISA) surveys a rep- 
resentative sample of the 15-year-old population in more than 40 countries and test results have 
been designed to be comparable over time. Unfortunately, the data is only available from 2000 on- 
wards, and therefore allows tracking relative changes in mean performance between genders only 
over the period 2000-2015. Figure A2 depicts the evolution of girls’ mean test score relative to 


boys in reading and mathematics over the period 2000-2015, for 41 countries sampled in all waves 
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PISA. It shows that while female relative average performance in reading seem to have increased 
over the period 2000-2015, females appear to do worse in mathematics relative to males in 2015 
compared to 2000. Recent international evidence is therefore mostly inconclusive regarding the 


increase of female mean performance, but the short time span covered is a limitation. 
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Figure A2: Relative Performance of Males and Females in Achievement Tests at Age 15, Over 
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Sources. PISA 2000, 2003, 2006, 2009, 2012 and 2015. 
Notes. The sample includes 31 countries that are included in all PISA waves. 
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A3_ Additional Tables and Figures 


Figure A3: Model Fit under The Tail Hypothesis: Female-to-male Ratio among Participants to 
Tertiary Education 
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Notes. The x-axis measures the enrollment rate in tertiary education for country 7. The y-axis measures the female-to- 
male ratio among individuals enrolled for country 7. The full line depicts the estimated relationship between y and x 
from our model when the ability distribution parameters {,1;; 77} are estimated by maximum likelihood. 


47 


Figure A4: Model Fit under The Tail Hypothesis: Female-to-male Ratio among Participants to 
Tertiary Education 
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Notes. The x-axis measures the enrollment rate in tertiary education for country 7. The y-axis measures the female-to- 
male ratio among individuals enrolled for country 7. The full line depicts the estimated relationship between y and x 
from our model when the ability distribution parameters {,1;; 07} are estimated by maximum likelihood. 
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Figure A5: Model Fit under the Tail Hypothesis: Gender Ratio among Secondary Non-completers 
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Notes. The x-axis measures the secondary school non-completion rate for country 7. The y-axis measures the female- 
to-male ratio among secondary school non-completers for country 7. The full line depicts the estimated relationship 
between y and x from our model when the ability distribution parameters {1;; 07} are estimated by maximum likeli- 
hood. 
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Figure A6: Model Fit under the Tail Hypothesis: Gender Ratio among Secondary Non-completers 
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Notes. The x-axis measures the secondary school non-completion rate for country 7. The y-axis measures the female- 
to-male ratio among secondary school non-completers for country 7. The full line depicts the estimated relationship 
between y and x from our model when the ability distribution parameters {j1;; 07} are estimated by maximum likeli- 
hood. 
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Figure A7: Estimated MBH Parameters, by Birth Cohort and Country Group 
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Notes. Each line represents the unweighted country group mean of the estimated model parameter value under the 
MBH. Panels A and B show the mean values of calculated z+ and Z,, under the MBH, estimated from the fraction of 
individuals that attended tertiary education and the gender ratio among those who attended, for thirteen 5-year-band 
cohorts classified by country group. The total sample size is 115 countries. Panels C and D show the mean values 
of calculated Z» and Z,,, under the MBH, estimated from the fraction of individuals that did not complete secondary 
education and the gender ratio among non-completers, for thirteen 5-year-band cohorts. The total sample size is 135 
countries. 
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Figure A8: Estimated MPH Parameter Values, by Birth Cohort and Country Group 
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Notes. Each line represents the unweighted country group mean of the estimated model parameter value under the 
MPH. Panel A. The graph shows the female mean jis, estimated from the gender ratio and enrollment rate in tertiary 
education for thirteen 5-year-band cohorts from the Barro-Lee dataset. Total sample size is 115 countries. Panel B. 
The graph shows the values of calculated Z, the ability threshold common to males and females. The total sample size 
is 115 countries. Panel C. The graph shows the values of ji, estimated from the gender ratio among non-completers 
and secondary school non-completion rates for thirteen 5-year-band cohorts from the Barro-Lee dataset. The total 
sample size is 135 countries. Panel D. The graph shows the values of Z, estimated from the same time-series as Panel 
C. Total sample size is 135 countries. 
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A4 Tail Hypothesis: Mathematical Proofs of Propositions 1 to 3 


A4.1 Normal Distributions 


Let f.(z) and f.m(z) denote the probability distribution functions of test-taking ability z for 


females and males, respectively. We assume for the sake of the argument that: 


efp~ N (uy, 0%) 


and 


em ~ N(Um, o*,), 


where 07, > 7, according to the tail dynamics hypothesis. 


Proof of Proposition 1. The female-to-male ratio R™" (Z) tends to zero when the total enrollment 


rate P™ (z) tends to zero. 


Gx (24 Gem (2) 
2 


First, it is straightforward to see that limz,,, P?"(z) = = 4° — 0, where G,(z) 
denotes the complementary cumulative distribution function (or tail distribution function) of test- 


taking ability z, defined as [."°° f.(z) dz. 


Let us now study limz_,., R?“(z). Using the analytical expression of the probability distribution 


function of the normal distribution, the ratio R’“ (Z) can be expressed as: 


(z-ny)? 
oo 2 
fe = xe Bes dz 
THis) _ a 
f (2) ae 1 (Z=um)? 
am)" 
- —=—e %m dz 
J; \/ 2702, 


Taking the integrals, one can express the ratio as: 


(rf) 
lim 


0 1 (1—erf Ee) z-sco1l—(erf[z]) 1-1 0’ 


where the second to last step follows from the fact that limz_,,, er f(Z) = 1 


Using the l’H6pital rule, we take the derivative of the denominator and the numerator to get the 


following expression: 


2 2 ra 2 2 a 22 = 22 
—_ Z— O ‘A= oO ZS 0, 
Z—>00 OF Om OF Z—>00 OF Tm? f O70, 


Z—00 OF 2 


ah a 20% — 2 moe + e,07 — 202, + Qyyor, — p5o?, 
= lim exp ae 
f m”™ f 


3 


e ae 2.2 
R 0” z - Min? Mypom 
=! ¢—o7 }-2 m «+ Qusos be =; 
Jim a cm a EG: Om} — 2MmOz + 2p Om : - 


since, by assumption under the tail dynamics hypothesis, 07, > OF, which are both positive by 
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definition. 


Proof of Proposition 2. The female-to-male ratio R™" (Z) tends to one when the total enrollment 


rate P?(zZ) tends to one. 


— Gz (2+Gem(Z) 44d 1 
= 2 ge ae 


First, it is straightforward to see that limz-sin pry P7” (Z) 


Let us now study the behavior of R7“(Z) when Z tends to —oo: 


in A in OD Es 
eect (1 —erf | 42 ) 2-00 1 — (er f [2}) a. 
where we use the fact that limz_,_. er f(z) = —l. 


Proof of Proposition 3. There exists a value of P™(Z) such that R'™™(z) = 1. This value is 


unique and always exists. 


Let us now show that given our distributional assumptions, there exists a value of z denoted 2*, 
such that the numerator and denominator are of equal value, thus the ratio is one. Again, we invoke 


the ratio: 


ng tte [29)) 
3 (1-erf |e] 


Since we know that the error function is monotonously increasing on the whole domain, R?“ (Zz) = 


1 when: 
Hs 2 2 
20° i z- [bm OF — LO. 
ae | et = He z= m f : m 
Qo2 OF Om OF — On 


This equation has a unique solution given 07, > oF. Since the support of F(Z) is the whole real 
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line, there always exists a value of Z denoted z* such that: 


=y [mF _ ce 
2 = —+—_,, 


= 
Ce 


In addition, Z* is unique given the vector of exogenous parameters {J f, [m, o%, came 


A5_ Proof that the MPH is equivalent to the MBH for uniform distribution 


within the intersection of their domains 


MBH: For each time period ¢t, let b = a+cand z ~ unif(a,b) = unif(a,a +c). Then, 
G.(Z) = i= = ie a! ae 7 


= 1- =*. Also, let (*): 2m = Z and (**): Zp = Z+ a — ay. Now, 
for Zs, %m € {a,a+c} @Z+a—-ayz,Z € {a,a+ ch, 


BMeey 


5) an OD ce (1 : 
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4c 4c 

5 G.(Zs) Zp—a ym — C 2f-G@,C Im—a 
RMS hh ie =| _ fs 

(25, 2m) GAza) Ga (at+c)-a c C ls C 
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C— ZF +Q (xx) C— 2+ af 
C—2Zm+a C22 2G 


MPH: For Z + a — ay, Z € {a,a+ c}, 


EMPH (41,2) = Gem (2) + Ga, (us, Z) = 1 1 zZ—a L4- Ae 
2 2 (at+c)—-a (ag +c) — af 
1 2% -a— 97 -—-a— 
gps EE) ig Oh aes ac 
2 2c Ac 
RO Gy 2) = Gaal?) = Zz — af _ zZ—-a ae ese zZ—-a 
- Gy,{2) (ar +c) — as (a+c)-a c Ce C 


Thus, the MBH and MPH are equivalent when test-taking ability is uniformly distributed and when 
Z+a—ay,z € {a,a+c}. The difference between the two hypotheses comes from non-linearities 


in G.(-). 


ay 


